Feat：somark plugins #2487

Soul-Code · 2026-01-26T05:44:38Z

Add Somark tool plugin for converting documents (PDFs, images, etc.) into structured Markdown or JSON format using the Somark API.

Features:

Document extraction with OXR (Optical Everything Recognition) algorithm
Support for multiple file formats (PDF, PNG, JPG, etc.)
Configurable API endpoint and authentication
Max file size: 50MB/50 pages

Related Issues or Context

This PR contains Changes to Non-Plugin

Documentation
Other

This PR contains Changes to Non-LLM Models Plugin

I have Run Comprehensive Tests Relevant to My Changes

This PR contains Changes to LLM Models Plugin

My Changes Affect Message Flow Handling (System Messages and User→Assistant Turn-Taking)

My Changes Affect Tool Interaction Flow (Multi-Round Usage and Output Handling, for both Agent App and Agent Node)

My Changes Affect Multimodal Input Handling (Images, PDFs, Audio, Video, etc.)

My Changes Affect Multimodal Output Generation (Images, Audio, Video, etc.)

My Changes Affect Structured Output Format (JSON, XML, etc.)

My Changes Affect Token Consumption Metrics

My Changes Affect Other LLM Functionalities (Reasoning Process, Grounding, Prompt Caching, etc.)

Other Changes (Add New Models, Fix Model Parameters etc.)

Version Control (Any Changes to the Plugin Will Require Bumping the Version)

I have Bumped Up the Version in Manifest.yaml (Top-Level Version Field, Not in Meta Section)

Dify Plugin SDK Version

I have Ensured dify_plugin>=0.3.0,<0.6.0 is in requirements.txt (SDK docs)

Environment Verification (If Any Code Changes)

Local Deployment Environment

Dify Version is: , I have Tested My Changes on Local Deployment Dify with a Clean Environment That Matches the Production Configuration.

SaaS Environment

I have Tested My Changes on cloud.dify.ai with a Clean Environment That Matches the Production Configuration

gemini-code-assist · 2026-01-26T05:44:56Z

Summary of Changes

Hello @Soul-Code, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request integrates the Somark DocAI platform as a new plugin within Dify, significantly enhancing its document processing capabilities. Users can now convert diverse document types, such as PDFs and images, into structured Markdown or JSON outputs. This integration provides advanced document understanding and data extraction, facilitating the incorporation of document content into LLM training, RAG systems, and intelligent agents.

Highlights

New Somark Plugin Integration: This pull request introduces a new Dify plugin for Somark, a DocAI platform designed for advanced document processing.
Document AI Capabilities: The plugin enables the conversion of various document types, including PDFs and images, into structured Markdown or JSON formats. It leverages Somark's proprietary 'OXR' (Optical Everything Recognition) algorithm for precise content extraction.
Seamless Workflow Integration: A new 'Extract Document' tool is provided, allowing users to easily integrate Somark's document parsing capabilities into their Dify workflows. Configuration requires a Somark API Key for authentication.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request adds a new somark plugin, which is a great addition. The code is well-structured, but I've found a few issues that should be addressed before merging. These include a critical bug in API URL construction, missing credential validation, potential runtime errors, and several inconsistencies in metadata and documentation. Addressing these points will improve the plugin's robustness and user experience.

tools/somark/provider/somark.yaml

tools/somark/tools/extract.py

tools/somark/provider/somark.py

tools/somark/tools/extract.py

tools/somark/manifest.yaml

tools/somark/README.md

tools/somark/tools/extract.py

tools/somark/tools/extract.yaml

crazywoola

See comments.

tools/somark/manifest.yaml

Add Somark tool plugin for converting documents (PDFs, images, etc.) into structured Markdown or JSON format using the Somark API. Features: - Document extraction with OXR (Optical Everything Recognition) algorithm - Support for multiple file formats (PDF, PNG, JPG, etc.) - Configurable API endpoint and authentication - Max file size: 50MB/50 pages

1. Improve error handling and type hinting in extract tool 2. Add credential validation in provider 3. Ensure icon resource exists

Soul-Code · 2026-01-30T06:05:09Z

Hi @crazywoola,

Thanks for the review! I've addressed all your comments. Please take another look when you have a chance.

dosubot bot added the size:XL This PR changes 500-999 lines, ignoring generated files. label Jan 26, 2026

dosubot bot added the enhancement New feature or request label Jan 26, 2026

gemini-code-assist bot reviewed Jan 26, 2026

View reviewed changes

crazywoola reviewed Jan 26, 2026

View reviewed changes

tools/somark/manifest.yaml Outdated Show resolved Hide resolved

Soul-Code had a problem deploying to tools/somark January 30, 2026 02:13 — with GitHub Actions Failure

Soul-Code had a problem deploying to tools/somark January 30, 2026 03:09 — with GitHub Actions Failure

Soul-Code had a problem deploying to tools/somark January 30, 2026 03:14 — with GitHub Actions Failure

Soul-Code had a problem deploying to tools/somark January 30, 2026 03:18 — with GitHub Actions Failure

Soul-Code temporarily deployed to tools/somark January 30, 2026 03:52 — with GitHub Actions Inactive

Soul-Code temporarily deployed to tools/somark January 30, 2026 03:57 — with GitHub Actions Inactive

Soul-Code temporarily deployed to tools/somark January 30, 2026 05:39 — with GitHub Actions Inactive

Soul-Code temporarily deployed to tools/somark January 30, 2026 05:51 — with GitHub Actions Inactive

Soul-Code added 11 commits January 30, 2026 14:02

fix: revert dify_plugin version to 0.3.0 for compatibility

34ff639

fix(somark): enhance code robustness and credential validation

322af55

1. Improve error handling and type hinting in extract tool 2. Add credential validation in provider 3. Ensure icon resource exists

fix(somark): update author to langgenius in manifest

2da9867

fix(somark): add missing pyproject.toml configuration

dd2f5b2

fix(somark): add missing created_at field to manifest

9da9fcb

chore: add uv.lock for somark plugin

0710801

fix: correct extract.yaml structure and missing source config

e1c300b

fix(somark): update base url and endpoint construction

11c8533

docs(somark): rename asset files to meaningful names

32d32f7

docs(somark): update llm description in extract tool

ca66061

Soul-Code force-pushed the feat：somark-plugins branch from c46703a to ca66061 Compare January 30, 2026 06:02

Soul-Code deployed to tools/somark January 30, 2026 06:02 — with GitHub Actions Active

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat：somark plugins #2487

Feat：somark plugins #2487

Uh oh!

Soul-Code commented Jan 26, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Jan 26, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

crazywoola left a comment

Uh oh!

Uh oh!

Soul-Code commented Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Feat：somark plugins #2487

Are you sure you want to change the base?

Feat：somark plugins #2487

Uh oh!

Conversation

Soul-Code commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related Issues or Context

This PR contains Changes to Non-Plugin

This PR contains Changes to Non-LLM Models Plugin

This PR contains Changes to LLM Models Plugin

Version Control (Any Changes to the Plugin Will Require Bumping the Version)

Dify Plugin SDK Version

Environment Verification (If Any Code Changes)

Local Deployment Environment

SaaS Environment

Uh oh!

gemini-code-assist bot commented Jan 26, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

crazywoola left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Soul-Code commented Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Soul-Code commented Jan 26, 2026 •

edited

Loading